Swarm Based Features Selection for Text Summarization
نویسندگان
چکیده
The features are the main entries in text summarization. Treating all features equally causes poor summary generation. In this paper, we investigate the effect of the feature structure on the features selection using particle swarm optimization. The particle swarm optimization is trained using DUC 2002 data to learn the weight of each feature. The features used are different in terms of the structure, where some features were formed as combination of more than one feature while others as simple or individual feature. Therefore the determining of the effectiveness of each type of features could lead to mechanism to differentiate between the features having high importance and those having low importance. We assume that the combined features have higher priority of getting selection more than the simple features. In each iteration, the particle swarm optimization selects some features, then corresponding weights of those features are used to score the sentences and the top ranking sentences are selected as summary. The selected features of each best summary are used in calculation of the final features weights. The experimental results shown that the simple features are less effective than the combined features.
منابع مشابه
PSO-Based Feature Selection for Arabic Text Summarization
Feature-based approaches play an important role and are widely applied in extractive summarization. In this paper, we use particle swarm optimization (PSO) to evaluate the effectiveness of different state-of-the-art features used to summarize Arabic text. The PSO is trained on the Essex Arabic summaries corpus data to determine the best particle that represents the most appropriate simple/combi...
متن کاملImproving the Operation of Text Categorization Systems with Selecting Proper Features Based on PSO-LA
With the explosive growth in amount of information, it is highly required to utilize tools and methods in order to search, filter and manage resources. One of the major problems in text classification relates to the high dimensional feature spaces. Therefore, the main goal of text classification is to reduce the dimensionality of features space. There are many feature selection methods. However...
متن کاملIntegrating of the Diversity and Swarm Based Methods for Text Summarization
Automatic text summarization is a technique concerning the creation of a compressed form for single document or multidocuments. The summary creation under the condition of the redundancy and the summary length limitation is a challenge problem. The automatic text summarization system which is built based on exploiting of the advantages of different resources in form of an integration model coul...
متن کاملParticle Swarm Optimization Based Feature Selection and Summarization of Customer Reviews
The steady growth of e-commerce has led to a significantly large number of reviews for a product or service. This gives useful information to the users to take an informed decision on whether to acquire a service and/or product or not. Opinion mining techniques are used to automatically process customer reviews for extracting feature and opinion in a concise summary form. Existing feature based...
متن کاملEXTRACTION-BASED TEXT SUMMARIZATION USING FUZZY ANALYSIS
Due to the explosive growth of the world-wide web, automatictext summarization has become an essential tool for web users. In this paperwe present a novel approach for creating text summaries. Using fuzzy logicand word-net, our model extracts the most relevant sentences from an originaldocument. The approach utilizes fuzzy measures and inference on theextracted textual information from the docu...
متن کامل